|
|
Accession Number |
TCMCG075C26297 |
gbkey |
CDS |
Protein Id |
XP_017983725.1 |
Location |
join(6621560..6621731,6622122..6622204,6622628..6622722,6623718..6623821,6624229..6624398,6624526..6624640,6625080..6625135,6625224..6625262,6625433..6625531,6625806..6625868,6625987..6626049,6626163..6626245,6626418..6626526,6626626..6626679,6627248..6627335,6627514..6627551,6630144..6630294,6631068..6631234,6631392..6631451) |
Gene |
LOC18588725 |
GeneID |
18588725 |
Organism |
Theobroma cacao |
|
|
Length |
602aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018128236.1
|
Definition |
PREDICTED: imidazole glycerol phosphate synthase hisHF, chloroplastic isoform X2 [Theobroma cacao] |
CDS: ATGGAGGGGGTGCCATATGCTTACACTACAAGCTTCAAAACACAATTATTTTTGTCGTCTGCACTGTCATCATCGTCTATTATAACCATCCACCAAAAGCGTCACAAAACTATTTTAAAATCCATATCTCGTAGAAATCTTGTTATCTGTGCTTCATCTGGTTCTAGTTCTGTTGTGAAGTTGCTTGATTATGGAGCTGGAAATGTTCGGAGCTTAAGGAATGCTATTCACCATCTTGGCTTTGAGATAGAGGATGTGCAAACTCCAAAAGACATTTTGGATGCTGAACGCCTTATCTTTCCTGGTGTTGGGGCATTTGCTTCAGCCATGGATGTATTGGTCAAGACCGGGATGGCTGACGCACTTTGTTCCTATATCAAGAATGATCGCCCATTTCTAGGCATTTGTCTTGGCCTTCAACTACTTTTTGAGTCTAGTGAAGAGAATGGACCAGTGAATGGTCTAGGCTTGATACCTGGTGTGGTTGGGCGGTTTAACTCTTCAAATGGTTTTAGAGTACCCCATATTGGCTGGAATGCTTTGCAAATTACAAAAGACTCTGAAATTTTGGATGACATTGGAGATCACCATGTCTACTTTGTTCACTCTTACCGTGCCATGCCATCAGATGATAACAAGGAATGGATTTCATCTACATGCAATTATGGTGATGATTTTATAGCGTCTATCAGAAGGGGAAATGTGCATGCAGTTCAGTTCCATCCAGAGAAGAGTGGAGATGTTGGTCTTTCTGTATTGAGAAGGTTTCTAGATCCAAAGTCACAGGGGACAAAGAATCTTACTCAGGGGAAGGCTTCAAAACTTGCTAAGAGGGTGATTGCTTGTCTTGATGTTAGGACGAATGATAAGGGGGATCTTGTTGTCACCAAAGGGGACCAGTATGATGTACGAGAGCACACAAAAGAGAATGAGGTGAGAAACCTTGGCAAACCTGTGGAGCTTGCTGGACAGTATTACAAAGATGGGGCTGATGAGGTCAGTTTTTTGAACATTACTGGCTTCCGTGACTTCCCATTAGGCGATTTACCAATGTTGCAGGTATTAAGACGCACTTCAGAGAATGTTTTTGTCCCACTAACGGTCGGAGGTGGTATACGAGATTTTACAGATGCAAATGGCAGGCACTATTCTAGTTTGGAGGTTGCTTCAGAGTACTTTAGGTCTGGGGCTGATAAAATTTCCATTGGGAGTGATGCAGTTCATGCAGCAGAAGAATATATGAAAACCAAAGTAAAGACAGGAAAGAGCAGCTTAGAACAAATTTCTAAAGTCTATGGAAATCAGGCAGTAGTTGTAAGCATTGATCCTCGTAGAGTGTACCTTAAAAGTCCTAATGATGTGCAGTTCAAGACCATAAGGGTCACAAAACCAGGTCCAAGTGGAGAAGAATATGCTTGGTATCAGTGTACGAAGTCTTTATCTTATGCACATCCTGAATGGCTTTCTGTTCCTAAGGTTAATGGTGGGCGTGAAGGTCGACCAATTGGGGCTTATGAGCTTGCAAAAGTAGTTGAAGAACTGGGAGCTGGAGAAATACTATTGAACTGCATTGATTGTGATGGTCAAGGAAAAGGATTTGATATAGATTTAATAAAGCTGATATCAGATGCTGTCAGCATCCCTGTAATTGCAAGTAGTGGTGCCGGTGCTGTTGAACACTTCTCGGAGGTATTCATGAAAACAAATGCATCAGCAGCTCTTGCTGCTGGCATTTTCCATCGGAAGGAGGTGCCCATTCAGTCTGTAAAAGAACACTTGTCGAAGGAAGGCATTGAAGTAAGGATATAG |
Protein: MEGVPYAYTTSFKTQLFLSSALSSSSIITIHQKRHKTILKSISRRNLVICASSGSSSVVKLLDYGAGNVRSLRNAIHHLGFEIEDVQTPKDILDAERLIFPGVGAFASAMDVLVKTGMADALCSYIKNDRPFLGICLGLQLLFESSEENGPVNGLGLIPGVVGRFNSSNGFRVPHIGWNALQITKDSEILDDIGDHHVYFVHSYRAMPSDDNKEWISSTCNYGDDFIASIRRGNVHAVQFHPEKSGDVGLSVLRRFLDPKSQGTKNLTQGKASKLAKRVIACLDVRTNDKGDLVVTKGDQYDVREHTKENEVRNLGKPVELAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRRTSENVFVPLTVGGGIRDFTDANGRHYSSLEVASEYFRSGADKISIGSDAVHAAEEYMKTKVKTGKSSLEQISKVYGNQAVVVSIDPRRVYLKSPNDVQFKTIRVTKPGPSGEEYAWYQCTKSLSYAHPEWLSVPKVNGGREGRPIGAYELAKVVEELGAGEILLNCIDCDGQGKGFDIDLIKLISDAVSIPVIASSGAGAVEHFSEVFMKTNASAALAAGIFHRKEVPIQSVKEHLSKEGIEVRI |